Updating Sets of Probabilities 
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Abstract 



There are several well-known justifications 
for conditioning as the appropriate method 
for updating a single probability measure, 
given an observation. However, there is a 
significant body of work arguing for sets of 
probability measures, rather than single mea- 
sures, as a more realistic model of uncer- 
tainty. Conditioning still makes sense in this 
context — we can simply condition each mea- 
sure in the set individually, then combine the 
results — and, indeed, it seems to be the pre- 
ferred updating procedure in the literature. 
But how justified is conditioning in this richer 
setting? Here we show, by considering an 
axiomatic account of conditioning given by 
van Fraassen, that the single-measure and 
sets-of- measures cases are very different. We 
show that van Fraassen's axiomatization for 
the former case is nowhere near sufficient for 
updating sets of measures. We give a con- 
siderably longer (and not as compelling) list 
of axioms that together force conditioning in 
this setting, and describe other update meth- 
ods that are allowed once any of these axioms 
is dropped. 



1 INTRODUCTION 

A common criticism of the use of probability theory 
is that it requires the agent to make unrealistically 
precise uncertainty distinctions. One widely-used ap- 
proach to dealing with this has been to consider sets of 
probability measures as a way of modeling uncertainty 
(see, for example, [Breese and Fertig 1991; Gilboa 
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and Schmeidler 1993; Huber 1981; Kyburg 1974; Levi 
1980; Smith 1961]). Given that one adopts the sets- 
of-measures model, how should one update these mea- 
sures in the light of new evidence? There is an "ob- 
vious" approach available, which is to apply standard 
probabilistic conditioning to each of the measures in 
the set individually and then combine the results. It is 
typically taken for granted that this is the appropriate 
thing to do (see, for example, [Cozman 1997]). But 
what justifies this approach? 

There have been numerous attempts to justify condi- 
tioning as the appropriate way to update single prob- 
ability measures. The standard approach involves 
Dutch Book arguments [Kemeny 1955; Shimony 1955; 
Teller 1973]. However, these arguments have not 
always been viewed as so convincing; see Bacchus, 
Kyburg, and Thalos [1990] and Howson and Urbach 
[1989] for a summary of these arguments and some 
counterarguments against them. In any case, even 
if we accept the standard justifications for condition- 
ing, there is no a priori reason to believe that they 
must also apply to the sets-of-measures case. In fact, 
they may not, and demonstrating this is a major point 
of this paper. We focus here on a different, yet sim- 
ple and compelling defense of (ordinary) conditioning, 
due to van Fraassen [van Fraassen 1987; Hughes and 
van Fraassen 1985]. Van Fraassen considers two sim- 
ple and arguably quite reasonable properties that we 
might demand of an update process and shows that 
conditioning is the only mechanism that satisfies these 
properties. We show that these properties are not suf- 
ficient in the sets-of-measures case. Indeed, there are 
numerous other update mechanisms that satisfy them. 
We also show that, by postulating enough extra prop- 
erties, we can recover conditioning as the unique so- 
lution; however, the properties we seem to need are 
far less compelling than those required for the original 
result. 

We begin with an informal description of van 
Fraassen's result. He wants to examine arbitrary ap- 



proachcs for updating probabilities in the light of now 
evidence. Thus, he considers a function upd (for up- 
date) that takes two arguments — a probability mea- 
sure Pr on a domain W and a subset B C W and 
returns a new probability measure upd(Pr, B), which, 
intuitively, is the result of updating Pr by the evidence 
B. It certainly seems reasonable to require 

if Pr' = upd{Pr, B), then Pr'(B) = 1. (1) 

That is, after updating, we should ascribe probability 
1 to the evidence we have obtained. 

Another reasonable principle to require is what van 
Fraassen calls symmetry. We can also think of it as 
representation independence, in the sense of [Halpern 
and KoUer 1995]. Intuitively, suppose we represent a 
situation using the worlds in W' rather than those in 
W. Let / transform W to W; there is a correspond- 
ing transformation /* of a probability Pr on W to 
a probability /*(Pr) on W' . Then we would expect 
upd to respect this transformation. Roughly speaking, 
this means that if Pr' = upd(Pr, B), then we would 
expect /*(Pr') — upd{f*(PT), f{B)). More precisely, 
HB CW' and Pr' = upd{PT, f~^{B)), then we would 
expect /*(Pr') = upd{f*{Vv),B). The formal defini- 
tion is given in Section 3, but an example here might 
help explain the intuition. 

Example 1.1: Consider two agents who are reason- 
ing about a given situation. One uses the primitive 
propositions p, g, and r; the second uses p and q. Let 
W consist of the eight truth assignments to p, q, and 
r, and let W consist of the four truth assignments 
to p and q. We take the eight worlds in W to be of 
the form Wijk, i,j,k G {0,1}, where i, j, and k give 
the truth value of p, q, and r, respectively. Thus, for 
example, in wioi, p and r are true while q is false. 
Similarly, we take the worlds in W to have the form 
w'-j . We now consider the obvious mapping / from W 
to W that maps Wijk to w'^j. Given a measure Pr on 
W, f induces a measure /*(Pr) on W in the natu- 
ral way, i.e., by taking /*(Pr)(w^) = Pr({wijo, 
We want it to be the case that our updating rule re- 
spects this transformation. In this case, that would 
mean that f*{upd{ViJ-^{B))) = upd.{f* {Pr), B) for 
each B C W'. Thus, for example, we would have 
f*{upd{Pi,{wijo,Wiji}) = upd{f*{Pv),{wij}). I 

Van Fraassen showed that the only updating rule that 
is representation independent in this sense and satisfies 
(1) is conditioning. 

We can apply van Fraassen's approach to the sets-of- 
measures case in a straightforward way. We again con- 
sider update functions that take two arguments, but 
now the first argument is a set of probability mea- 
sure rather than a single probability measure, and 



the output is also a set of probability measures. Van 
Fraassen's two postulates have obvious analogues in 
this setting (which we formalize in Section 3). How- 
ever, as we show by example, they are no longer strong 
enough to characterize conditioning. 

One interesting update function that satisfies both 
conditions is what Voorbraak [1996] has called con- 
straining. This updating function is defined as fol- 
lows: given a set X of probability measures and an 
observation B, it returns all the measures in X that 
assign B probability 1. That is, the observation B is 
viewed as placing a constraint on the set of probability 
measure — namely, that B must be assigned probabil- 
ity 1. We then return all the measures in the set that 
satisfy this new constraint. Voorbraak argues that 
constraining is actually more appropriate than con- 
ditioning when it comes to capturing a probabilistic 
analogue of the notion of expansion in the AGM [Al- 
chourron, Gardenfors, and Makinson 1985] theory of 
belief change (where expansion is how beliefs change 
when we get extra information that is consistent with 
previously- held beliefs). 

In Section 3, we provide seven postulates on update 

functions that suffice to guarantee that an update 
function on sets of measures acts like conditioning. Be- 
sides van Fraassen's postulates, our postulates include 
a "homomorphism" postulate, which says that the re- 
sult of updating a set X of measures is the union of 
the result of updating each element in X separately, 
and a compositionality postulate, which says that up- 
dating by B and then by C is the same as updating 
hy B (1 C (and hence also the same as updating by 
C and then by B). We also include a postulate that 
limits the amount by which the post-update probabil- 
ity of an event can exceed the value it would obtain 
under conditioning. The intuition for this is that an 
extremely improbable event should not receive a post- 
update probability that is "too large" . The postulate 
that we use to capture this intuition is arguably too 
strong; it is an open problem to what extent it can be 
weakened. 

Although our postulates are quite strong, we show that 
no subset of them suffices to force conditioning. Inter- 
estingly, if we drop our last postulate, then there are 
exactly two update functions that are consistent with 
the remaining six: conditioning and constraining. 

We believe we arc the first to try to axiomatize the 
updating of sets of probability measures, but others 
have certainly examined the issue of updating other 
notions of uncertainty that are related to sets of prob- 
ability measures. Besides the work of Voorbraak cited 
above, we briefly mention three other lines of research: 

• It is well known that a Dempster-Shafer be- 



lief function Bel [Shafer 1976] can be associ- 
ated with the set of probability measures that 
dominate it, that is, the set VbsI = {Pr : 
Pt{A) > Be\{A) for all A}. In fact, Bel(A) = 
infprePj5^i Pr(74). "^^Y defining the up- 

date of Bel by a set B, considered in [Fagin and 
Halpern 1991; Jaffray 1992], is to take Bel(-|B) = 
infprePjj^i Pr(A|i?). This approach to updating 
is quite different from Dempster's Rule of Con- 
ditioning [Shafer 1976]. (See [Halpern and Fagin 
1992] for a discussion of the differences.) Moral 
and de Campos [1991] consider yet other ap- 
proaches to updating belief functions. 

• Gilboa and Sclmieidler [1993] consider update 
rules for non-additive probabilities (of which be- 
lief functions are a special case, as are convex sets 
of probability measures). They show that under 
certain assumptions, the maximum-likelihood up- 
date rule is equivalent to Dempster's Rule of Con- 
ditioning. We discuss these results in more detail 
in Section 3. 

• Walley [1991] has a theory of lower and upper 
previsions based on gambles and considers an ap- 
proach to updating previsions called the gener- 
alized Bayes rule, which, as the name suggests, 
generalizes standard conditional probability. Sets 
of gambles can be associated with (convex) sets 
of probability measures; Moral and Wilson [1995] 
consider approaches to revising closed sets of gam- 
bles given another gamble^ and relate their ap- 
proaches to the AGM postulates. 

The rest of this paper is organized as follows. In Sec- 
tion 2 we define update functions carefully and give 
some examples of them. In Section 3, we state our pos- 
tulates. In Section 4 we outline the proof of our main 
result, which is that our postulates characterize con- 
ditioning. Despite the strength of our postulates, our 
proof is surprisingly difficult, which is perhaps further 
evidence that quite strong postulates are necessary to 
characterize conditioning in the sets-of-measures case. 
We conclude in Section 5. 

2 UPDATE FUNCTIONS 

The general framework we work in is a straightforward 
extension of van Fraassen's. Suppose we have a mea- 
sure space M = {W,!F), that is, a domain W and an 
algebra over W."^ Let Aj^ consist of all probability 

^ Since events can be viewed as a special case of gam- 
bles, this is a more general notion of updating than that 
considered here. 

^An algebra T over 14^ is a sot of subsets of W that in- 
cludes W and is closed under complementation and union. 



measures over A4 . An update function on is a func- 
tion Upd : 2^^ X 2^^, such that Upd{X, B)=% 
if Pr(B) = for all Pr e X.^ That is, Upd takes as 
input a set of probability measures over M. and an 
element of J^, and returns a set of measures over M. . 
Intuitively, Upd{X, B) consists of the result of updat- 
ing the measures in X by the observation B. li X 
is the singleton set {Pr}, we write Upd{PY,B) rather 
than Upd{{PT},B). Note that for us, however, unlike 
for van Fraassen, Upd(Pr, B) is a set of measures (pos- 
sibly empty), not a single measure. 

Van Fraassen's symmetry requirement (i.e., represen- 
tation independence), considers not one update func- 
tion, but two, acting on different domains, and re- 
lates their outputs. Thus, we are interested in families 
Upd/^ of update functions, one for each measure space 
M . We use Upd as a way of denoting the whole family 
{Upd^}.^ 

As defined, families of update functions can be com- 
pletely arbitrary. They can act like conditioning in 
one space and return a fixed probability measure in 
another. We now give examples of seven families of 
update functions that are not completely arbitrary, in 
that they satisfy a number of properties of interest to 
us, although in some cases their behavior is quite far 
from conditioning. 

. Upd^^^iX, B) = {Pr(.|S) : Pr G X, Pr(B) > 0}. 

UP'^cond is the standard update via conditioning. 
More precisely, we condition when possible; we 
simply discard those probability measures Pr e X 
such that Pr(B) = 0. 

• Upd-^nstrain{X,B) = {Pr G X : Pr(B) = 1}. 

^P'^constrain j^^t Voorbraak's [1996] notion of 
constraining, as discussed in the Introduction. 
Note that f/prf^„«t™„(A, B) = if A contains 
no probability measures Pr such that Pr(i?) = 1. 

. Upd^,g,^{X, B) = {Pr e Am : Pr{B) = 1}. 

so that if A, B £ X, then so arc A and Au B. In the 
case that W is infinite, we could also require that .7^ be a 
a-algebra, that is, closed under countable union. None of 
our results would change if we made this requirement. 

^The final condition in this definition is analogous to 
the conventional restriction that one cannot condition on 
a measure event. It is well known that the problem of 
defining a sensible notion of "update" for measure events 
is a nontrivial one, oven in the the conventional (single 
measure) framework. However, this problem is (largely) 
orthogonal to the topic of this paper. Note that this con- 
dition implies that Upd{0, B) = 0. 

^Readers concerned about cardinality considerations 
should think in terms of restricting to domains that have 
at most a certain cardinality, such as the cardinality of the 
reals. 



With Updjgj,g^i, wc ignore the information in X al- 
together. While this may seem to be a completely 
uninteresting update function, note that it can be 
viewed as modeling an agent who learns B, but 
then forgets what he knew before (which we can 
think of as being encoded by X). It points out the 
role of "no forgetting" in conditioning, an issue to 
which we return below. 

Upd^,„JX,B)=t 

We have already seen that, in general, we may 
have Upd{X,B) = even if X 7^ 0. With 
Updfriviab take this one step further and have 
the output be the empty set independent of X 
and B. While this is clearly a rather uninterest- 
ing update function, we must be careful about 
what requirements to impose to ban it, so we do 
not ban too much. 

Upd-^,,^,,{X,B) = Upd^,^^{X'^,B) = ({Pr(-|i?) : 
Pr e X^Pr(B) 7^ 0}), where X" denotes the 
topological closure of X , that is, X'^ consists of 
all measures Pr such that for all e > 0, there ex- 
ists a measure Pr' e X such that sup^g^p | Pr(^) — 
Pr'(A)| < e. 

Updciosure shows that update functions can take 
topological conditions into account. Note that 
Upd^ond ^iid Upd^igg^^g agree on all finite sets X. 
The difference between them only arises if their 
first argument is infinite. For example, suppose 
M2 = ({1,2},2{1'2}) and X = {Pr e Aa^, : 
Pr({2}) < 1}. Then f/pd^^,(X {1, 2}) = X, 
while C/pd^„^,„„(X,{l,2}) = since = 

^M2- every probability measure in not in 

X must give {2} probability 1, and can be ap- 
proximated arbitrarily closely by a measure in X. 

UpdZ,set{^r,B) = 

{Pr(- |C ^B)■.C <^W, Pr(C A > 0} 

if Pr(B) < 1 

{Pr} if Pr(B) = 1 

UpdZ,set{X, B) = Up.eX UpdZbsetiP^, B). 

Intuitively, if Pr(B) < 1, then UpdZubset(P^ ^ B) 
amounts to conditioning on all events we could 
learn in addition to B. We treat the case that 
Pr(i?) = 1 specially, to ensure that Upd^^i^^^^ sat- 
isfies one of the postulates we consider. For ar- 
bitrary sets X of probability measures, we apply 
Updgy^bset pointwise (and then take unions). Our 
interest in Upd^^^set motivated by the fact that 
it satisfies many natural properties while being 
quite difl^erent from Upd^^^^^ and Upd^„„,t^^,„. 

UpdZ,L{X,B) = {Pr(.|B) : Pr € X,Pr(B) > 
0,Pr(B) =supp,,exPr'(5)}. 



UpdML is maximum likelihood rule considered 
by Gilboa and Schmeidler [1993] (except that they 
restrict to the case that X is a closed convex set, 
which guarantees that there is some Pr G X such 
that Pr(i?) = suppr/£j(^ Pr'(i?)). It is an instance 
of what they call a classical update rule, which is 
one of the form Upd^{X,B) ^ {Pr(-|B) : Pr G 
X' C X}, for some appropriately chosen X' . Note 

that Upd^g^^, t'prfconstramJ Updt„„ah and J/prf^L 

are all classical update rules in this sense. 
3 THE POSTULATES 

What properties should an update function have? We 
want to start by imposing the two properties consid- 
ered by van Fraassen. The first is easy to formalize in 
our framework. 

PI. Upd-^{X, B) C {Pr G A^ : Pr(S) = 1} 

That is, if we learn B, we want to assign probability 1 
to B. Notice that all seven update functions described 

above satisfy this postulate. 

To define the second postulate (i.e., representation in- 
dependence) carefully, we review some material from 
[Halpern and KoUer 1995]. What does it mean to shift 
from a representation (i.e., a measure space) M. = 
{W,T) to another representation M.' = {W',J^')1 
There arc many ways of shifting from one represen- 
tation to another. For us, it suffices to consider what 
is perhaps the simplest case, where each world in W is 
associated with several worlds in W. We can think of 
representation W as being richer than representation 
W, in the sense of using additional primitive proposi- 
tions or random variables to describe a world. This is 
the situation in Example 1.1, where in W we used three 
primitive propositions to describe a world, whereas in 
W we used only two. We can then associate with each 
world in W' all the worlds in W that agree with it on 
all the primitive propositions it uses. Formally, this 
association is captured by a surjective map from W to 
W. 

Definition 3.1: A representation shift from A4 = 
{W,T) to M' ^ iW,T'), also called an M M' rep- 
resentation shift, is a measurable surjective map from 
W to W, that is, a surjection f : W W such 
that f^^{B) G J- for all B ^ T' (where, as usual, 
f-\B) = {xeW:f{x)eB}). I 

As is well known, /^^ is a homomorphism with respect 
to unions and complementation, that is, f~^{BUB') = 
f-^{B) U f^HB') and f-\B) = f-^{B) (where we 
use U to denote the complement of U). The fact that 



/ is surjcctivc also makes / ^ 1-1.''' An M -M' repre- 
sentation shift also induces a map /* : ^M''i 
we define (/*(Pr))(A) = Pr(/-i(A)).6 Finally, if 
X C Aa<, then we define f*{X) = {/*(Pr) : Pr e X}. 

With these definitions, wo can formally state van 
Fraassen's representation independence property. In- 
tuitively this says that, so long as we are updating by 
an event in J^' , it should not make any difference if 
we are fact working in a space M. that is capable of 
making finer distinctions than does A4'. Another con- 
sequence is that the "labels" attached to points cannot 
affect how we update measures. 

P2. Let / be an A4-M.' representation shift. If 

X CAm a^ndB e T' , then Upd^' {f*{X),B) = 
nUpd^{XJ-\B))). 

As we said in the introduction (and will also follow 

from the results in Section 4), van Fraassen showed 
that if we consider update functions upd from proba- 
bility measures to probability measures (rather than 
from sets of probability measures to sets of probabil- 
ity measures), then PI and P2 (appropriately modified 
to deal with upd rather than Upd) suffice to guaran- 
tee that upd is conditioning. However, it is easy to 
see that all seven of the update functions described in 
Section 2 satisfy both PI and P2. 

Can we impose other reasonable properties that re- 
strict the set of allowable update functions? One prop- 
erty of conditioning is that order does not matter. Up- 
dating by B and then C is the same as updating by 
C and then _B, and both arc the same as updating by 
BOC. This property does not follow from PI and P2 
in our more general setting. Updj^^^^^ provides a coun- 
terexample: if we update by B and then by C using 
Updfg^ggf, we get all the probability measures that give 
C probability 1; if we update by C and then B, then 
we get all the probability measures that give B prob- 
ability 1; if we update by S n C, we get all probability 
measures that give BUC probability 1. Thus, we add 
the requirement that updates commute to our list of 
properties as well. 

P3. Upd-^ ( Upd^ {X, B),C)= Upd-^ {X, BnC) 

Although P3 is a standard property of conditioning, it 
is far from innocuous. It can be viewed as encoding an 

assumption of "no forgetting" . Intuitively, in order for 
updating by B and then C to be the same as updating 

^In the language of [Halpern and Koller 1995], if / is an 
M-M' representation shift, then is a faithful M'-A4 

embedding. 

''/* is what van Fraassen calls a measure embedding. 



by B O C, the agent must remember the information 
in B when he is updating by C. 

It is easy to see that P3 is not satisfied by either 
Updfg^^ggf. or Updf^i^, although it is satisfied by the 
other update rules defined in Section 2. As observed 
in [Fagin and Halpern 1991; Jaffray 1992], the up- 
date rule for belief functions define by Bel(-|i?) = 
infprgT^g^i Pr(^|i3) also does not satisfy P3.^ On the 
other hand, Dempster's Rule of Conditioning does sat- 
isfy P3. Gilboa and Schmeidler [1993] provide suffi- 
cient conditions on X that guarantee that Updj^^j^ sat- 
isfies P3.® Moreover, they show Updj^j^^X, B) acts like 
Dempster's Rule of Conditioning for those sets X that 
satisfy these conditions. 

We clearly need further postulates to rule out functions 
besides Updj^^ and Upd^g^^g^f. The next postulate says 
our beliefs don't change if we learn information that 
we expected to be true all along (i.e., which was given 
probability 1 by all measures in our current set). This 
suffices to rule out Upd^^^i^i- 

P4. Upd^{X, B)=X if Pr(B) = 1 for all Pr e X. 

It is easy to see that P4 is satisfied by Updg.^i,ggf as 
weU as Upd^g,^^ and Upd,.g^^i^^„^. (Note that the spe- 
cial treatment of Updg^i,^^^{Pr, B) in the case that 
Pr(i?) = 1 was necessary to ensure this.) Although 
P4 is not satisfied by Upd^ig^^^.^, we could modify 
Updciosurei^^ B) in the special case that Pr(i3) = 1 
for all Pr e X so that it does satisfy P4. We thus need 
a stronger condition to rule out update functions such 
as Upd^igg^^^. The next postulate, which says that the 
action of an update function on a set of measures is 
determined by its action on the individual members of 
the set, does that. 

P5. Upd^{X, B) = Uprex Upd^{Pr, B). 

As we have seen, Upd^ig^.^^^ does not satisfy P5; it acts 
like Upd^g^j^ on finite sets, but disagrees with Upd^^.^^^ 
in general on arbitrary sets. Updj^j^ does not satisfy P5 
either. It might seem that once we force the behavior 
of an update function to depend only on its behavior 
of singletons, we should be able to appeal immediately 
to van Fraassen's result. This, however, is not true, 
because Upd^(Pv,B) can still be an arbitrary set of 

Jaffray [1992, Corollary 2] characterizes the restricted 
circumstances when P3 holds for this update rule. 

'^Suppose that X consist of a set of probability measures 
onM = (W,J^). Let fx{A) = min{Pr(yl) : Pr e X} for 
A S The Gilboa-Schmeidler conditions say that (1) X 
is convex, (2) X = {Pr € Am ■ Pr(A) > fx{A)}, and (3) 
fxiA\JB) + fx{AnB) > fx{A) + fxiB)Jor&\\ A,BeJ'. 
See [Gilboa and Schmeidler 1993] for the motivation for 
these conditions. 



probability measures. All of the update functions in 
our examples other than Upd^i^^^^^ and Updj^j^ satisfy 
PI, P2, and P5. 

It might seem that P5 gives too special a role to the 
action of Upd on singleton sets. This is not in keeping 
with the spirit modeling uncertainty by arbitrary sets 
of probability measures. We can rewrite P5 to avoid 
mention of singleton sets by requiring instead that Upd 
commute with arbitrary unions, that is, 

Upd^{Uj^jXj,B) = UjejUpd^{Xj,B), 

where J is an arbitrary index set. It is easy to see 
that this postulate is equivalent to P5 while not giv- 
ing a special role to singleton sets. We wrote P5 as we 
did because in fact we need it only for singleton sets. 
Note that it is important to allow the index set J to 
be arbitrary here. The perhaps more appealing pos- 
tulate Upd^ (X UY,B)= Upd-^ [X, B) U Upd-^ {Y, B) 
(which can be extended by induction to show that Upd 
commutes with finite unions) is not strong enough to 
eliminate Upd^i„,^^^. 

All of the properties we have considered so far are sat- 
isfied by Updg^i,g,,f. We consider here two ways of elim- 
inating Updgy^g^^. Neither is as clean as we would like. 
After introducing the postulates, we discuss possible 
alternatives. 

One way of eliminating Upd^^j^^^^ is to require that on 
a singleton argument, Upd returns either a singleton 
or the empty set.^ More precisely, we have 

P6'. \Upd-^{Pv,B)\ < 1. 

P6' again puts more emphasis on singleton sets than 
we would like, and seems somewhat strong. A quite 
different approach to eliminating Upd^^^^^^ is based 
on the observation that it is not "continuous": ev- 
ery event in B that has nonzero probability according 
to Pr (including ones whose probability is negligible) 
will be given full belief (probability 1) in at least one 
of the post-update measures, while the rest of B (per- 
haps containing almost all of iJ's probability accord- 
ing to Pr) is given probability 0. Perhaps there should 
be some limit on how much the probability of a small 
event can increase. The next postulate ensures this, by 
requiring an upper bound on the post-update proba- 
bility relative to the original conditional probability. 

P6". For all M = {W, J^) and Pr e ^m, there exists a 
constant c such that for all A,B & T and Pr' e 
Upd^ (Pi, B), we have Pr'(A) < cPr(^|B). 

^Wc must allow it to return an empty set since 
Upd-^ {Pr, B) is required to be if Pr(_B) = 0. 



Clearly P6" suffices to eliminate Upd^^i^g^,). However, 
it does not seem as natural as our other assumptions. 
Even if we accept the need for a continuity postulate, 
there seem to be weaker and more natural formaliza- 
tions of it. In fact, the following postulate seems to 
assert continuity more directly: 

P6*. For all M = {W,T), Pr e A,v(. B e and 
e > 0, there exists (5 > such that ii A £ T and 
Vt:{A\B) < 5, then Pr'(A) < e for every Pr' e 
Upd^{Pr,B). 

P6* also suffices to rule out Upd^^^f^g^f. However, we 
have not been able to prove that PI P5 and P6* force 
Updcond and f/pdconstram- Replacing P6* by P6' or P6" 
does the trick though. Let P6 state that either P6' or 
P6" holds. 

P6. Either P6' holds for all measures spaces A4 or P6" 
holds for all measure spaces A4. 

The main result of this paper, proved in the next sec- 
tion, is that Upd^„^^ and Upd^„^^t^^i„ are the only up- 
dating functions that satisfy P1-P6. 

What is the relationship between P6', P6", and P6*? 
It is easy to see that P6" implies P6*: given B, e, 
and c as in P6", we can take 6 < ePr(S)/c. As we 
show in the appendix, P6' together with PI and P2 
implies both P6* and P6". Finally, it follows from 
the main result of this paper that P6" together with 
P1-P5 implies P6' and P6*. 

Once we are down to f/prf,„„^ and f/pdco„,t„,„, it is 
easy to add another postulate to get just Upd^^^g^^g^^^. 
The following weak postulate suffices: 

P7. There exists some measure space A4 = {W,J^), 
some set X C Am , and some set B e ^ such that 
Pr(S) ^ 1 for all Pr e X and Upd^{X, B) ^ 0. 

It should be clear that Upd^^^^ satisfies P7, while 

Upd^onstrain doCS not. 

4 THE MAIN THEOREM 

The main result of the paper is the following. 

Theorem 4.1: The only update functions that satisfy 
P1-P6 are Upd^„„a and Upd^^^.t^^i^. 

The following corollary is then immediate: 

Corollary 4.2: The only update function that satisfies 
P1-P7 IS Upd,,„,. 



In this scx'tion. wc give a high-level outline of the proof 
of Theorem 4.1. Further details of the proof are de- 
ferred to the appendix. We omit the proof of some 
of the more technical and difficult lemmas because of 
limited space. 

It is worth beginning with the following lemma, of 
which van Fraassen's result is a corollary. Roughly 
speaking, it says that the post-update probability we 
give to an event cannot be consistently smaller than 
its conditional probability. 

Proposition 4.3: Suppose that Upd satisfies PI and 
P2 and that for some Pr' e Upd!^ {Y>v,B) and A such 
that < Pr(A|B) < 1, we have Pr'(A) < Pr(A|B). 
Then there also exists some Pr" G Upd^ (Pr, B) such 
that Vr"{A) > Pr{A\B). 

Proof: See the appendix. | 

Van Fraassen's result follows almost immediately from 
Proposition 4.3, as the following result shows. 

Proposition 4.4: // Upd satisfies PI and P2, and 
Upd-^{Pv,B) = {Pr'}, then Pr' = Pr(-|B). 

Proof: Suppose Upd^{Pr,B) = {Pr'} and Pr'(A) ^ 
Pt{A\B) for < Pt{A\B) < 1. Then either Pr'(A) < 
Pt{A.\B) or Pt'{B - A) < Pr(B - A\B). But this 
contradicts Proposition 4.3, because there is no cor- 
responding Pr". A separate, but simple, argument is 
needed when Pv{A\B) e {0,1}. If Pr{A\B) = 1, con- 
sider any disjoint Ai, A2 such that A ~ Ai U A2 and 

< Pt{Ai\B) < 1. (We can assume that such Ax^Ai 
exist, appealing to P2 if necessary.) But then 

Pr'(A) = Pr'(Ai)+Pr'(A2) 

= Pr(Ai|B) + Pr(A2|B) 
= Pr(^|B), 

where we use the fact that < Pr(yli|B), Pt{A'2\B) < 

1 and the previous argument. Finally, the case of 
Pr(^|i?) = follows by considering B — A. Thus 
Pr' = Pr{-\B). I 

It follows from Proposition 4.4 that P6' implies both 
P6" and P6* in the presence of PI and P2. 

The next step towards our result is to characterize 

Proposition 4.5: Suppose that Upd satisfies PI P5 

for some space M and 



and that Upd'^{Pv,B) = 
some B with Pr(B) ^ 0. Then Upd= fpc^constram- 

Proof: See the appendix. | 



Note that by P4, we cannot have Pr(_B) = 1 for the 
set B in Proposition 4.5. Thus, Proposition 4.5 tells us 
that if Upd satisfies P1-P6 (in fact, even if it satisfies 
just P1-P5) and docs not satisfy P7, then it must be 
^P'^constrain- remains to show that if Upd satisfies 
P7 as well as P1-P6, then it must be Upd^^^ii- 

In the case that Upd satisfies P1-P5, P6', and P7, 

it follows from Proposition 4.5 that wc must have 
\Upd^{Pr,B)\ = 1 if Pr(B) ^ 0. The fact that 
Upd = Upd^g^^ now follows immediately from Propo- 
sition 4.4. Thus, it remains to show that if if Upd 
satisfies PI P5, P6", and P7, then Upd — Upd^^^^. 

To show this, we first prove an easy lemma, which 
shows that Upd must agree with conditioning at least 
on events with extreme probabilities. 

Lemma 4.6: // Upd satisfies PI, PS, and P4, Pr S 

Am, AC B, and Pr(^|B) = 1 (resp., Pt{A\B) = 0), 
then for all Pr' G Upd-^{Pr,B), we have Pv'{A) = 1 
(resp., Pr'{A) = 0). 

Proof: Let M = {W,!F). It suffices to prove the result 
for Pr(A|i3) = 1, because the other case follows by 
considering B — A. 

Let C = W - {B - A). Since Pr{A\B) = 1, we have 
Pr(C) = 1. By P4, we have {Pr} = Upd-^(Pr,C), 
and thus Upd^{Pv,B) = Upd^{Upd-^{Pr,C),B). 
By P3, Upd^ ( Upd-^ (Pr, C),B) = Upd:^ (Pr, B n 
C) = Upd-^ {Pr, A). It follows that Upd'"^ (Pr, B) = 
Upd/^{PT:,A). But by PI, if Pr' € Upd/^{Pv,A), then 
Pr'(A) = 1. The result follows. | 

We now introduce a key concept. Given Upd, a mea- 
sure space M. = (W,T), a measure Pr e Ax, and 
events A C B e T such that < Pr(^) < Py{B) 
define 



U"P'^'-^-^'{A,B) 



sup 



Pv'{A) 



weUpdM{Vr,B) Pr(^)/Pr(B) 



(We take [/C^MM.Pr^^^ ^) ^ _^ jf [/^/^(Pr, b) = 0.) 
The intuition is that U^'P'^'^'^' {A, B) should be 
viewed as a measure of how far Upd is from Upd^^^^. It 
is the sup, over all the measures in Upd/^{Pv, B), of the 
ratio of the (post- update) probability of A to Py{A\B). 
Of course, if Upd = Upd^^^^, then U^p'^'^''^'{A, B) = 
1 for all A, B. As the following result shows, the con- 
verse holds as well. 

Proposition 4.7 : // Upd satisfies PI-P4 and 
jjUpd.M,Pr(^j^^^^ = 1 for all M, A, and B, then 
Upd= Upd^^^^. 



Proof: Suppose that Upd ^ Upd^^^^. Then there 
exists a measure space M. = {W,J^), Pr e Ax, 



B & A C B, and Pr' e f/prf-^(Pr,B) such that 
Pr'(A) ^ Pr(74|B). By Lemma 4.6 we know that 
Vr{A\B) is neither nor 1. If Pr'(A) > Vr{A\B), then 
jjUpd.M.Pr^j^ j^-^ > 1, contradicting our assumption. 
But otherwise, we have Pr'(A) < Pr(^|i3) and so, by 
Proposition 4.3, there exists Pr" e Upd^{Pv,B) such 
that Pr"(A) > Pr(^|B). Again, this contradicts our 
assumption. | 

In Ught of Proposition 4.7, we can complete the proof 
of Theorem 4.1 by proving the following result: 

Proposition 4.8: // Upd satisfies P1-P5, P6" , and 
PI, then U^P'^'^'^'{A,B) = 1 for all M, A, B. 

Despite the strength of P6", the proof of Proposi- 
tion 4.8 turns out to be surprisingly difScult. (More 
accurately, we have not been able to find a proof that 
is not difficult!) The details are in the appendix. 



5 DISCUSSION 



The main purpose of this paper is to illustrate how 
different the set-of-measures model can be, technically, 
from the standard single-measure model. In general, 
it is unwise to simply assume that a result in the stan- 
dard model can be trivially "lifted" to apply in general. 

In terms of understanding update procedures, one ob- 
vious next step would be to examine some of the other 
justifications for conditioning in the single-measure 
model, to see to what extent they carry over to the new 
setting. There are also outstanding questions even in 
the axiomatic framework considered here. P6, in par- 
ticular, is quite a strong assumption. Is it really nec- 
essary? We conjecture that our main result actually 
holds with P6 replaced by P6*, although we have not 
been able to prove this. (Recall that P6' implies both 
P6" and P6* in the presence of PI and P2.) 

We are clearly not advocating P6 (or even P6*) as be- 
ing anywhere near as compelling as, say. PI or P2. (Of 
course, one can raise reasonable arguments against PI 
and P2 — and most of the other postulates — as well.) 
So what does this say about Upd^^.^^^ and Upd^g.^^^^.^^^^? 
It seems quite plausible to us that other update pro- 
cedures, such as Updfji^, that will be appropriate in 
some circumstances. We believe that a more careful 
investigation into such alternative rules, and a contin- 
ued effort to try and clearly determine what makes an 
update rule appropriate to a given domain, would be 
worthwhile. 



A APPENDIX: PROOFS 

For the proofs, it is useful to define the family of spaces 
A^„ = ({!,..., n},2{i.-'"}). 

Proposition 4.3: Suppose that Upd satisfies PI and 

P2 and that for some Pr' € Upd/^ {Pi, B) and A such 
that < Pr(A|B) < 1, we have Pi {A) < Pt{A\B). 
Then there also exists some Pr" e Upd^{Pr,B) such 
that Pv"{A) > Pi{A\B). 

Proof: Consider M = {W, T) and assume that B ^ 
W. (The argument \i B = W is almost identical and 
left to the reader.) By P2, it suffices to prove the result 
in the case that M. = Mz- (For any other M., we can 
consider the surjection that maps Ato 1, B — Ato2, 
and W — B io^ and then appeal to P2.) 

There are now two cases to consider. First, assume 
that Pr:{A\B) < 1 is rational, say m/n. Now consider 
the space M.n+i- Let A = {1,. . . ,m}, B = {1, . . . , n}, 
and Pr give each of 1 , . . . , n equal probability. We can 
consider the surjection g : Wn+i W3 that maps 
1, . . . , m to 1, m + 1, . . . , n to 2, and n + 1 to 3. Thus, 
by P2 again, the result is true for A^3 if we can show 
that it is true for an arbitrary measure Pr on A^„+i 
and Pr' e Upd'^''+^ (Pr, B). Note that the reason we 
have to introduce both Ais and Ain+i is that there 
is always a mapping from any other M for which the 
proposition is relevant into the former space, but not 
necessarily into the latter. 

Clearly Pr' does not give equal probability to each 
of the points l,...,n (for if it did, we would have 
Pt'{A) = m/n). In fact, the average probability of 
a point in A (according to Pr') is 1/n — e/m, where 
e — Pr{A\B) — Pi'{A). There are two cases to con- 
sider: If Pv{A\B) > 1/2, then n - m < m. In this 
case, let C consist of the n — m elements of A with the 
lowest probability (according to Pr'). We must have 
Pr'(C) < {n — m)/n — {n — m)e/m. Let ft be a permu- 
tation on{l,...,n-|-l} that switches the points in C 
with the points in B ~ A such that h(n + 1) — n + 1. 
Note that h*{Pr) — Pr (since Pr gives equal prob- 
ability to all the points in B). Let Pr" = /i*(Pr'). 
Note that Pi"{B -A) < (n-m)/n- {{n - m)/n)e, 
so Pr"(^) > m/n + ((n - m)/n)e. If Pi{A\B) < 1/2, 
then n — m > m, and a similar argument works: This 
time, let C consist of the m elements of A with the 
lowest probability. In this case we get that Pr"(^) > 
rn/n + {rn/n)e. Since ft*(Pr') G Upd^ {Pi, B) by P2, 
we are done. Note that, in either case. Pi" [A) > 
Pr{A\B) + min{Pr{A\B), 1 - Pr{A\B))e. 

Next suppose that Pr(j4|_B) is irrational. Choose r ra- 
tional such that Pi{A\B) > r > Pi{A\B) - min(r, 1 - 
r)e/2. By using P2 again, it suffices to prove the 



result for M4, with A = {1,2}, B = {1,2,3}, and 
Pr({l}|B) = r. Let A' = {1}. Since Pr'(A') < 
Pr'{A) = Pr{A\B)-e, it follows that Pr'{A') < r-e/2. 
Since Pr(A'|_B) is rational by construction, by the pre- 
vious argument, there exists Pr" G Upd^(PT, B) such 
that Pr"(^') >r + min(r, 1 - r)e/2 > Pt{A\B). Since 
A' C A, we have Pi" {A) > Pr {A\B), as desired. | 

Proposition 4.5: Suppose that Upd satisfies PI- 
PS and that Upd^ (Pr, B) = % for some space M. and 

Pv{B) ^ 0. Then Upd= C/Kon«tram- 

We prove this using Lemmas A.1-A.3. 

Lemma A.l: // Upd satisfies P2, Upd^{PY,B) = 0, 
and Pr(i?) = a < 1, then for every space M! , prob- 
ability measure Pr' e Am', and B' e J^' such that 
Pr'(B') = a, we have Updh' {Pv' ,B') = 0. 

Proof: First assume that M' = M2 B' = {2}. Then 

the result is immediate using P2. (Consider the A4 
M' representation shift that maps B to {2} and B to 
{1}.) Now if A^' is arbitrary, we again get the result 
by applying P2 and using the fact that it holds for Al2- 
(Again, consider the M'-M2 representation shift that 
maps B' to {2} and B to {1}.) | 

We can bootstrap our way up to an even stronger 
lemma. 

Lemma A.2 : // Upd satisfies P2 and PS, and 
Upd-^{Pi,B) = for some B with Pr{B) = a < 1, 

then for every domain M.' , probability measure Pr' €E 
A^/, and B' C W' such that Pv'{B') < a, we have 
Upd^'{Pv',B') = 9. 

Proof: First consider the space M3 and let Pr" 
be a measure such that Pr"({2,3}) = a and 

Pr"({3}) = /3 < a. By Lemma A.l, we have that 
Upd^-'iPi", {2, 3}) = 0. By P3, we have that 

Upd^^{Pv",{2}) = Upd^^{{Upd^%Pv",{2,3}),{2})) 
= [/prf-^^(0,{2}) = 0. 

By Lemma A.l again, it follows that for every space 
M' = {W',T'), probability measure Pr' e Am', 
and B' G T' such that Pr'(B') = /3, we have 
Upd-^' {Pr , B') = 0. Since P was arbitrary, the de- 
sired result follows. | 

Finally, we get the strongest possible result of this 
type. 

Lemma A.3: // Upd satisfies PI, P2, PS, and P5, 
Upd^{Pi,B) = 0, then Upd^' {Pi' , B') = for every 



domain M.' , probability measure Pr' € A_m.i , and set 
B' such that Pr'(S') < 1. 

Proof: Let 7* = sup^, B{Pr'(S') : Upd^'{Pr',B) ^ 
0}. We want to show that 7* = 1. Suppose by way 
of contradiction that 7* < 1. Choose e > such that 
< 7* - e, 7* + e < 1, and (7* - e)/(7* + e) > 7*. 
Consider the space again and let Pr' be such that 
Pr'(2) = 7* - e and Pr'(3) = 2e. By choice of 7* and 
Lemma A.2, we have that Upd'^'' {Pr , {2}) = and 
Upd^'{Pv', {2,3}) ^ 0. By P3, we have 

Upd:^^{{Upd^^{Pr',{2,3}),{2}) 

= C/pd-^^(Pr',{2}) =0. ^ ' 

However, since Upd^^ {Pv' ,{2,3}) 7^ 0, by Proposi- 
tion 4.3, there exists some Pr" G /Tpc?"^'^ (Pr', {2, 3}) 
such that Pr"(2) > Pr'({2}|{2, 3}) = (7* - e)/(7* + 
e) > 7*. By Lemma A.2 and the choice of 
7*, we have that Upd-^' {Pr" , {2}) ^ 0. By P5, 
[/pd-^^(Pr",{2}) C Upd^^{{Upd^^{Px' ,{2,3]),{2}). 
This contradicts (2). Thus, we must have 7* = 1. | 

It follows from Lemmas A.2 and A.3 
that Upd^(Pr,S) = if Pr(B) < 1. Proposition 4.5 
now follows immediately from P4 and P5. | 

This completes the proof of Proposition 4.5. In order 
to prove Theorem 4.1, it remains only to prove Propo- 
sition 4.8. We first need a few preliminary lemmas. 
The first shows that the value of [/C^MM,Pr(^^ 5) de- 
pends only on Upd and the values Px{A) and Pr(B), 
but otherwise docs not depend on the details of M. or 
the exact identity of A and B. 

Lemma A. 4: // Upd satisfies P1-P5 and P7 there 
is a function V^P'^{x,y) defined for < x < y < 1 
such that U^P'^'^'^'{A,B) = V"p'^{Pi{A),Pt{B)) for 
allACBeJ^ with Pr{A) > 0. 

Proof: For < a; < y < 1, let Pr^^^ G A-^^ be 
defined so that Prx^y{l) = x and Pr^_y(2) = y — x (so 
that Pra,_y({l, 2}) = y). Finally, define 

V^P'^{x,y) = 

Jl ify = lora; = y 

\ [7^P'*'^='P^«.'' ({!},{!, 2}) ifO<a;<y<l. 

We must show that this definition has the required 
properties. Consider any M.,A,B,Px such that A C 
B. If Pr(A) < Pr(i?) < 1, then it is immediate 
from P2 that U^v'^'^'^''{A,B) = V^i"^{Pt{A),Pt[B)). 
If Pt{A) = Pr{B), then Lemma 4.6 assures us that 
if Pr' G Upd^{PT,B) then Pr'(A) = 1 and hence 
jjUpd,M,Pr(^j^^ B) = 1. Thus, the result follows as long 



as Upd (Pr, B) is nonempty. But this follows from 
Proposition 4.5 and P7. Finally, if Pr(i?) = 1, the 
result is immediate from P4.^'^ | 

Lemma A. 5: // Upd satisfies PI P5 and P7, then for 
any fixed y, V^^'^{x, y) is a non-increasing function of 
X such that V^P'^{y,y) = 1. 

Proof: See the full paper. | 

We need one more, rather technical, lemma. Our over- 
all goal is to show that Upd^(Pv,C) = {Pr(-|C)}. 
But perhaps it is reasonable to first ask a weaker 
question: is it true that Pr(-|C) € f/pd-^(Pr, C)? 
While the following lemma does not quite show this, 
it proves something in a very similar spirit. In partic- 
ular, the lemma implies that, for every event B d C, 
there is some measure Pr"^ G Upd'^{Pi:,C) such that 
Pr^(B) = Pr(B|C). Note, however, that Pr^ may de- 
pend on B and so this docs not prove that Pr(-[C) G 
Upd''^ (Pr, C). On the other hand, the lemma also does 
something more than just assert the existence of Pr^ . 
Consider another event A disjoint from B and any 
Pr' G Upd-^{FT,C). Of course, Pt'{A) does not neces- 
sarily equal Pr(A|C). But the lemma shows that there 
is another measure Pr" G Upd^ (Pr, C) that agrees 
with Pr' on A and "looks like conditioning" outside A, 
at least with respect to B. More precisely, Pr"(_B) is 
exactly the final probability oi C — A (i.e., 1 — Pr'(j4)) 
times the prior probability Pr(i?|C — A). Note that if 
A = $ this reduces to the earlier claim (since we can 
then take Pr^ = Pr"). 

Lemma A. 6: Suppose that Upd satisfies PI, P2, P0' . 

For all A,B C C C W such that A and B are dis- 
joint elements of T , Pr G A^, Vt(B) > 0, and 
Pr' G Upd^ {Pi, C), there exists Pr" G Upd^{Pr,C) 
such that Pt"{A) = Pr{A) and Pr"{B) = (1 - 
Pt'{A)) Pi{B\C-A). 

Proof: See the full paper. | 

This omitted proof is rather complex. Unlike the 

other results in this section, which involved only fi- 
nite spaces, it makes crucial of an uncountable space. 
More precisely, our proof makes crucial use of P2 ap- 
plied to representation shifts involving an uncountable 
space (the unit interval with Lebesgue measure). We 
conjecture that the result would actually be false if 
we restrict P2 to representation shifts involving only 
countable spaces. Of course, our main theorem might 
still be true even if we restrict P2 to countable spaces; 

^"We remark that the only place we use P3 in this proof 
is in the appeal to Lemma 4.6. With a little more effort, 
the result can be proved even without P3. 



it might have a different proof that does not rely on 
Lemma A. 6. 

Another point worth noting is that Lemma A. 6 can 
be proved using the weaker postulate P6* rather than 
P6". Unfortunately, this docs not seem to be true for 
Proposition 4.8, which we are finally ready to prove. 

Proposition 4.8: // Upd satisfies P1-P5, P6" , and 
PI, then uVvd,M,V'^(^A, B) = 1 for all M, A, B. 

Proof: By Lemma A. 5, we see that V^P'''{x,y) is 
bounded below by 1 (and in fact the bound is attained 
when x = y). On the other hand, by P6", there is 
also a finite upper bound c. In fact, we can assume 
that c is the least upper bound. Suppose by way of 
contradiction that c > 1. 

The basic idea of the proof is to use P3 and show that 
there is a sequence of two iterated updates in which 
some event's probability grows by more than c (relative 
to what we would expect from conditioning). We also 
show (using P3) that there is a single update which 
would give the same result. However, this contradicts 
the definition of c. 

Consider c' = {c-\- \/c)/2; note that c'^ > c but 1 < 
c' < c. In particular, by the definition of c and since 
c' < c we can find some .r, z such that U^'"^(x, z) > c'. 
Furthermore, by the monotonicity of V^^"^, we also 
have V^P'''{x', z) > c' for x' < x. In the following, con- 
sider any x' < x such that cx' / z < x. Now consider 
the space M4. Let A = {1}, B = {1, 2}, C = {1, 2, 3}, 
and let Pr be a measure such that Pr{A) = x' and 
Pr(C) = z. This does not completely specify Pr; in 
particular, the only constraint so far on Py{B) is that 
it lies between x' and z. Note, however, that the set 
{r : Pi {A) = r for some Pr' G Upd.-^'{Pv.C)} is in- 
dependent of the exact probability we give to B, be- 
cause of postulate P2. In the following, choose some 
value r from this set such that r > c'x'/z; the defi- 
nition of V^P"^ guarantees that such an r must exist, 
since V^p'''{x',z) > d . Note also that r < cx' jz by 
definition of c. Thus, r < a;, by choice of x' . 

We now complete the specification of Pr by defining 
Pr(2) = [z — r){z — x')/(l — r). It is easily verified 
that Pr({l,2}) is then between x' and z. Moreover, 
Pr({2}|{2,3}) is (z - r)/(l - r) (since Pr({2,3) = 
Pr(C — A) = z — x'). Therefore, by Lemma A. 6, 
there is some Pr' G Upd^"" {Pi, C) such that not only 
is Pi' {A) = r but furthermore 

Pi'{B) = Pr'(l)-hPr'(2) = r-\-{l-r){z-r)/{l-r) = z. 

However, now consider C/pd'^* (Pr', B). We know that 
Pi'{A) = r < cx' jz < X and Pi'{B) = z. Therefore 



(by the monotonicity of V ^"'p'^) wc have V ^^^''(r, z) > c' . 
Thus, there must be some Pr" G f/pd"^* (Pr', i?) such 
that Pr"{A) > c'r/z > c'^x'/z^. 

By P3, we know that conditioning on C then B must 

give the same rcsiilt as if wc were to condition directly 
on B. Thus, Upd^^iPi:, B) must also contain Pr". 
It follows that V^p'^{x',Ft{B)) > Pr" {A)Pr{B)/x' > 
c'^Pr(B)/z2. Recall that c'^ > c. Wc thus will have 
a contradiction with the definition of c if we can show 
there exists x' such that Pt{B) /z^ is close enough to 1 
so that c'^Pr(_B)/z^ exceeds c. But, in fact, we know 
that Pr(S) = x' + {z - r){z - x')/{l - r). It is clear 
that by choosing x' (and hence also r) to be sufficiently 
small, we can make this as close to 1 as we wish. The 
result follows. | 
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